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Most previous contributions to BSDEs, and the related theories 
of nonlinear expectation and dynamic risk measures, have been in 
the framework of continuous time diffusions or jump diffusions. Us- 
ing solutions of BSDEs on spaces related to finite state, continuous 
time Markov chains, we develop a theory of nonlinear expectations in 
the spirit of [Dynamically consistent nonlinear evaluations and expec- 
tations (2005) Shandong Univ.]. We prove basic properties of these 
expectations and show their applications to dynamic risk measures on 
such spaces. In particular, we prove comparison theorems for scalar 
and vector valued solutions to BSDEs, and discuss arbitrage and risk 
measures in the scalar case. 

1. Introduction. Most previous contributions to backward stochastic dif- 
ferential equations (BSDEs), and the related theories of nonlinear expecta- 
tion and dynamic risk measures, have been in the framework of continuous 
time diffusions or jump diffusions. The setting has usually been scalar. In 
this paper a situation is considered where randomness is generated by a 
continuous time, finite state Markov chain. A corresponding discrete time 
theory is covered in the paper [5]. In this paper both scalar and vector com- 
parison theorems are proved in this framework. Following [15] this gives rise 
to a theory of nonlinear expectations. The proofs and results are different 
from those in the diffusion case. 

Consider a continuous time, finite state Markov chain X = {Xt,t G [0,T]}. 
Without loss of generality, we identify the states of this process with the 
standard unit vectors in where is the number of states of the 
chain. 
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We consider stochastic processes defined on the filtered probabihty space 
($7, {J^t}, IP) where {J-t} is the completed natural filtration generated by 
the c7-fields = cr({X„, n < i}, F € J't : F(i^) = 0), and T = Tt. Note that, 
as X is a right-continuous jump process with distinct jumps, this filtration 
is right continuous. If At denotes the rate matrix for X at time t, then this 
chain has the representation 

(1.1) Xj = Xo+ / A^X^^du^Mt, 

J]o,t] 

where Mt is a martingale (see Appendix B of [9]). As in [4], we know that 
the predictable quadratic covariation matrix of this martingale, {M,M)t, 
has the representation 

(1.2) {M,M)t= [ (diag(A,X„„)-^„diag(X„„)-diag(X„„)A:)dn. 

J]o,t] 

In an earlier paper, [4], we considered pairs, {Y,Z), Zt gM^^^, adapted 
and left continuous. If € adapted and cadlag which are solutions to 
equations of the form 

(1.3) Yt- [ F{uj,u,Yu-,Zu)du+ [ ZudM^ = Q. 

J]t,T] J]t,T] 

Here Q is an measurable, M^^ valued, P-square integrable random vari- 
able, and F is a map which is progressively 
measurable. 

Let ipt be the nonnegative definite matrix, 

(1.4) := diag(AfXf_) - yl* diag(Xf_) - diag(Xf_)A* 
and 

(1.5) \\Z\\j,^_:=TT{Zi;tZ*)- 

Then || • \\xt- defines a (stochastic) seminorm, with the property that 

(1.6) TT{Ztd{M,M)tZl) = \\Ztfx,. dt. 
We have shown the following result in [4]. 

Theorem 1.1. For Q € LP' {J-t), if F is Lipschitz continuous, in the 
sense that there exists c> such that, for any Z^jZ"^, Y^,Y'^ square inte- 
grable and of appropriate dimension, 

\\F{t,Y^_,Zl)-F{n,Ytl,Z^)\\<c{\\Zl-Znx._ + ||y/_ - y,^. ||) 
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dt X P-a.s., then there exists a solution {Y,Z) to (1.3), such that 



E 



\Yt\rdu 



o,Tl 



E 



0,T] 



tWxt^ 



du 



< +00, 



< +00, 



and this solution is the unique such solution, up to indistinguishability for 
Y and equality d{M,M)t x F-a.s. for Z. 

In this paper, we shall develop properties and applications of these solu- 
tions. 

Note that, where appropriate, we shall denote by <, >, etc. an inequality 
holding simultaneously on all components of a vector quantity. 



2. Extension to stopping times. We first extend the results of [4] to 
include the case where T, the terminal time of our BSDE, is an essentially 
bounded stopping time. By this we mean that T is a stopping time and there 
exists a real value T'"^^ such that P(r > T™^^) = 0. 

Assume that F(u,t,Yt-, Zt) is well defined and Lipschitz continuous dt x 
P-a.s. on [0,T™'^^]. Then from Theorem 1.1, there is a unique solution of the 
BSDE, 

(2.1) Yt- \ Iu<TFiu;,u,Yu-,Zu)du+ [ ZudMu = Q, 

for any Q G L'^^Ft'^^,^); this of course includes all Q G L'^{Tt), as F(r < 
2^max^ _ measurable Q, we evaluate (2.1) at T, and take an Ft 

conditional expectation. This gives 



Yt + E 



Zu dMu 



Ft 



Q 



P-a.s., and as M is a martingale we see Yt = Q P-a.s. It follows from this 
that jjj^^^max] ZudMu = P-a.s., hence Zt = 0, d{M,M)t x P almost surely, 
for all T < t < T™^^. Consequently, we can rewrite our BSDE as 



(2.2) 



Yt - [ F{uj,u,Yu-,Zu)du+ [ ZudMu 

J]tAT,T] J]tAT,T] 



Q, 



]tAT,T] 



which is a useful generalization of (1.3). 

We shall restrict ourselves to deterministic T in the following; however, it 
is clear that, given appropriate modifications, the results stated could easily 
be modified to remain valid when T is an essentially bounded stopping time. 



4 



S. N. COHEN AND R. J. ELLIOTT 



3. Basic theorems. Before developing specific applications of these process- 
es, we establish the following results. The methods used are based on those 
in [7], where similar results are proven for BSDEs on spaces related to Brow- 
nian motions. 

We shall henceforth assume that F is P-a.s. left continuous in t, is Lips- 
chitz continuous as in Theorem 1.1 and satisfies 



E 



[ \\F{u,t,Yt^,Z-, 

J]0,T] 



t)\\^dt 



< +00 



for all Y, Z bounded as in Theorem 1.1. Such a driver will be called standard. 
If also Q € L?'{Ft)-, then the pair {F,Q) will be called standard. 

We shall assume that the rate matrix A of our chain is left continuous, 
and there is an < e,- < 1 such that it satisfies 



(3.1) 



e*Atej e[er,l/er]U{0} 



dt-a.s., for all i and j, i j. This assumption is trivially satisfied if the chain 
X is time- homogenous, and essentially states that we shall not consider 
chains with positive transition rates unbounded or arbitrarily close to zero. 

3.1. Various lemmas. Throughout this section, 1 will denote a column 
vector of appropriate dimension with all components equal to one. 

We first note that, from the basic properties of stochastic integrals (see, 
e.g., [14], page 28), the following isometry holds. 



Lemma 3.1. For any predictable (matrix) process Z (of appropriate di- 
mension) any s <t, 









2- 






E 




[ Zu dMu 




= E 


u 



]s,t] 



Proof. Let (•, •) denote the predictable quadratic covariation. Expand- 
ing the norms in terms of traces and products, we wish to show 



ETr 



(3.2) 



Zu dMu 



s,t] 



ZudMu 



s,t] 



E 



TT[Zud{M,M)u{ZuY] 



If Jj^ Zu dMu is square integrable, then 



ZudM,. 



Zu dMu 



ZudMu 



Zu dMu 
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is a uniformly integrable martingale (see [14], page 38), and 



Zu dMu 



s,t] 



Zud{M,M)u{Zu) 



(by [14], page 48, Theorem 4.4). Combining these equations and taking a 
trace and an expectation gives the result in this case. 

If J^^^-^ZudMu is not square integrable, then, as both sides of (3.2) are 
nonnegative, they both equal +oo. The result follows. □ 

Lemma 3.2. For {F,Q) standard, ifY is the solution to (1.3) given by 
Theorem 1.1, then Y satisfies 

sup E[\\Ytf] <+oo. 

te[o,T] 



Proof. We have 

sup ^[11^*11^] = sup E 
te[o,T] 



te[o,T] 
<2E[\\Qf]+AE 



Q+ F{uj,u,Yu-,Zu)du- / ZudMu 

J]t,T] J\t.T] 

F{uj,u,Yu^,Zu)\[^ du 



]0,T] 



+ 4 sup E 

te[o,T] 



<2E[\\Qf]+AE 



ZudMu 



t,T] 



/ \\F{uj,u,Yu-,Zu)\\'^ du 
J]0,T] 



+ 4E 



\Z 



]0,T] 



uWXt- 



du 



As Q G L?'{J-t), F is standard and E[j^Q,j,^ \\Zu\\'x^_ du] is finite by the con- 
ditions of Theorem 1.1; this gives the desired inequality. □ 

Lemma 3.3. For all t, the matrix ipt is symmetric and bounded, and 
has all row and column sums equal to zero. The matrix given by its Moore- 
Penrose inverse, ip'^ , is also bounded and has all row and column sums equal 
to zero. 

Proof. As At is a rate matrix, A^l = 0. Hence, from (1.4), 
= diag{AtXt-)l - At diag(Xt_)l - diag(X<_)A*l 
= AtXt- - AtXt- - 
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that is, the rows of tpt ah sum to zero, and so ipt is singular. 

The symmetry of ipt is trivial, which implies the columns of ipt also all 
sum to zero. For clarity, an example of ipt is presented for iV = 4 in the 
Appendix. 

For any matrix with real components, the Moore-Penrose inverse exists. 
We first note that, if and only if CjAtXt^ = 0, the jth. row and column of ■i/'t 
will be zero. Also, if we exclude the row and column indicated by Xt- , "0* is a 
diagonal matrix. Therefore, if tpt has m rows and columns of zeros, the rank 
of tpt will be — 1 — m (unless m = A^, in which case it will be zero). Let ^l^t 
denote 'i/'t excluding the row and column corresponding to Xt^ and any rows 
and columns of zeros. Using the results of [6], Theorem 6.1, we can express 
i/jf as the inverse of ipt, pre- and post-multiplied by bounded matrices with 
row and column sums of zero, as appropriate. As ipt is a diagonal matrix, 
its inverse is bounded by (3.1). Hence tpf' is bounded, and has all row and 
column sums equal to zero. □ 

Lemma 3.4. At each time t, for all j such that e*AtXt- ^ 0, 



Furthermore, the vectors {ej — Aj_) form a basis for the range of the pro- 
jections, Tp^Tpt and ijjtipf . 

Proof. Again we use the results of [6], in particular Theorems 6.1, 7.1 
and 7.2. By Cohen, Elliott and Pearce [6], Theorem 6.1, we know that ipf is 
a true inverse for and {ipf)'^ is a true inverse for ijjf, within the class of 
matrices with both row and column sums zero and with rows and columns 
of zeros in the same rows and columns as ijjt- Therefore = (^^)^. The 
desired equation is then simply the result of [6], Theorem 7.1. The remaining 
statement is then simply an application of [6], Theorem 7.2. □ 

Lemma 3.5. For any standard driver F, for any Y and Z , up to indis- 
tinguishability, 



ipt'^tiej - Xt-) = iptipti^^j - ^t~) = ej - Xt-. 



F{uj, t, Yt-,Zt) = F{uj, t, Yt-,ZtiPt'^t) 



and 




Therefore, it is no loss of generality to assume that Z = Zipip' 
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Proof. Recall that, from the definition of the Moore-Penrose inverse, 
ipti^fipt = ^^1- Hence, for any t, P-a.s., 



TV([Z„ - ZMu] d{M, M)u[Zu - Z^^^^+Y) 



TV([Z, - Zui^ui^t]^t[Zu - ZuM^T) dt 



0,i] 

= 0. 

By the Lipschitz continuity of F this implies 



E 



\\F{uj, u, Yu-,Zu) - F{uj, u, Yu-,Zu'4>utpt)\\'^ du 



<c^E 



o,t] 



\Zu - Zutputpt\\% du 



0. 



As F is P-a.s. left continuous in t, and Yt- and Z are left continuous pro- 
cesses, we use the result of [8], Lemma 2.21, to show that 

F{uj, t, Yt- , Zt) = Fiu, t, Yt- , 



up to indistinguishability. 
By Lemma 3.1, 



E 



o,t] 



{Zu - Zuipuip^)dMu 



E 



o,t] 



\Zu — Zu'>pu'>Pu 



du 



and so for all t, P-a.s. 



o,t] 



{Zu-Zui^ui^+)dMu = o. 



As this process is a stochastic integral, it is cadlag (see [14], Theorem 4.31), 
and therefore we can again use [8], Lemma 2.21, to show the stochastic 
integrals are equal up to indistinguishability. □ 

Lemma 3.6. = only if Ztij^t'^t ~ 0- Hence without loss of 

generality, \\Zt\\xt^ =0 if and only if Zt = 0. 

Proof. We know that 

0=\\Zt\\x,.=TiiZti^tZn. 
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As tpt is nonnegative definite, there exists a matrix P such that ipt = PP* , 
the Cholesky decomposition of ipf Hence 

= TriZtiPtZn = Tr{ZtP{ZtPr), 
which is true if and only if ZfP = 0. Hence, without loss of generality, 

Zt = ZMt = ZtPP*^Pt =0. □ 
Lemma 3.7. For all t e[0,T], 

(3.3) ^|JtXt^ = -AtXt^ = - Y,{ejAtXt-){ej - Xt-). 

j 

Proof. From (1.4), 

^PtXt- = dmg{AtXt-)Xt- - Atdiag{Xt-)Xt- - diag{Xt-)AlXt-. 
As Xt^ is a standard basis vector in M^, we have 

i^tXt-=dmg{AtXt-)Xt- - AtXt- - {diag{Xt-)At)Xt-. 

Further, 

dia.g{AtXt-)Xt. = {X:_AtXt-)Xt. = (diag(X<_)A*)Xt-, 

therefore 

tlJtXt- = -AtXt-, 

establishing the first equality. 

We now note that, as l*^^ = 0, we have 

x:_AtXt.=- Yl i^j^tXt-) 



and hence 



AtXt^= ie*AtXt-)ej- ie*AtXt- 

j-.Xt-j^Bj j:Xt-y^ej 

= Y,i^*AtXt^){e,-Xt^). 



Xt 



□ 



Lemma 3.8. For processes Z solving (1-3), without loss of generality, 
the II • ||xt_ norm has two equivalent forms: 

\\Zt\\l^_=Tx{Zt^tz;) 

= Yi^*Aat-MZt{ei-Xt.)]\ 
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Proof. Consider the || • \\xt- norm of Zt- The trace can be calculated as 
Ti(Zt'ijjtZf) = "^j^e^ZtiptZ^ei, and therefore it is clear that we can consider 
each row e*Z separately from the others. Because of this, we need only to 
establish the result where Z is a single row vector. 

For any row vector of the form v = clear that 

diag{Xt-)v* =0 

and, therefore, 

\\v\\xt- = viptv* 

= v{dmg{AtXt-) - Atdiag{Xt.) - diag{Xt.)A*t)v* 

(3.4) 

= ( E Cje*]{diag{AtXt-))( ^ CjcA 
= Y: ie-AtX,.)c]. 

We shall use this "linearity" to establish the result. 

By Lemma 3.5, without loss of generality, we can write Zt = Zti^ti^f . We 
can therefore write Zt as a linear combination Zt = Ylj^ji^j ~ ^t-)* ■, by 
Lemmas 3.3 and 3.4. For simplicity, we define Cj = if Cj = Xt-. 

We define a vector of the form considered above, 

ej^Xt- ^ k 

(3.5) 

= E c^-s* + (E 1* - (E ^0 
j ^ k ^ fe ^ 

and as if^t is a matrix with column sums of zero, this and Lemma 3.4 imply 

ZMt = E ^J'S* - (E ^^■) ^t- = 

j j 
By Lemma 3.4, we then know 

Ztisj -Xt-) = ZtiJt^ti^j -Xt-) = Zt{ej - Xt.) 
and it follows from (3.5) that 

(3.6) Cj + ^ Cfc = Zt{ej - Xt-) = Zt{ej - Xt-) 

k 

for all Bj / Xt- . 
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As Z = Zt/jtipt^ and iptipftpt = '4't, it is clear from (1.5) that 



l^Wxt- - W^WX-, 



2 

t- 



□ 



Lemma 3.9. Without loss of generality, the row sums of Zt and Zfipt 
are all zero for all t. 

Proof. The row sums of Z can be written Zl. As ip and ip'^ are matrices 
with ah row and column sums equal to zero, ipl = ip'^l = which directly 
shows, using Lemma 3.5, Zl = Zt[j^^l = and Zipl = 0. □ 

Lemma 3.10. Consider a process Z sM'^^^ solving a standard, scalar, 
(K = 1) BSDE of the form of (1-3). Suppose that for a given t, Zt is such 
that \\Zt\\xt^ 7^ 0; (^^d, for some e, 

< e < e^/2Ar-3/2 

with £r as in (3.1), we know 

-e\\Zt\\x,. < {e*AtXt-)Zt{ej-Xt^) 

for all j . Then 

ZtiptXt^ < -£\\Zt\\xt-- 

Proof. We know from Lemma 3.8 that 

\\Zt\\l^_=TT{Zt^l^tZ:) 

= Y,{e*AtXt-)[Zt{e,-Xt^)]\ 
j 

By (3.1), for all Cj / Xt-, e*AtXt- G [Sr, l/sr] U {0}, for some Sr > 0, and 
so 

\\Ztfx,^< Yl e;\Zt{e,-Xt.)f 

{j:e*AtXt_>0} 

(3.7) 

<Ne-^ max [Zt{ej - Xt-)f . 

{j:e*AtXt->a} 
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Multiplying both sides by N^^Sr, this implies there must be an ej with 
ejAtXt^ > such that 

erN-^\\Zt\\%_ <[Zt{ej - Xt-)f. 
As e*AtXt- >er, we have 

^3/2^-1/2 11^^ ||^^_ < ^e*AtXt-)Zt{ej-Xt-). 
For all Cj such that {e*AtXt-)Zt{ej — Xt-) < 0, we have, by assumption, 

-eWZtWx, < {e*AtXt-)Ztie, - Xt-) < 0. 
By Lemma 3.7, we deduce 

Zt^tXt- = -ZtAtXt- 

= - ^{e*AtXt-)Zt{ej - Xt-) 



< -El'^N-^'^Ztfx^_ + {N- l)e\\Zt\\x,_ 

< -£\\Zt\\xt-, 

where the final inequality is from the assumption that e < e^^'^ N~^/'^ . □ 
3.2. Linear BSDEs. 

Theorem 3.11 (Linear BSDEs). Let {a, 13,^) be a du x F-a.s. bounded 
^^KxN ^^KxK ^-^K^ yQ^ly^gd predictable process, 4> a predictable M'^ valued 
process with E[J^qj,^ {{(ptW^ dt] < +00, T a deterministic terminal time and 

Q a square-integrable J-t measurable M'^ valued random variable. Then the 
linear BSDE given by 

(3.8) Yt- j [(Pu + PuYu- + auZl^u]du+ [ ZudM^ = Q 

J]t,T] J]t,T] 

has a unique square integrable solution {Y,Z) (up to appropriate sets of 
measure zero). Furthermore, if 

(3.9) I + asi^tiej-Xs-K 

is invertible for all j such that e*AsXs- > 0, except possibly on some evanes- 
cent set, then Y is given by the explicit formula 



TfQ+ [ T'lcPudu 

J]t.T] 



(3.10) Yt = E 

up to indistinguishability. Here Tf is the adjoint process defined for t < s <T 
by the forward linear SDE, 

(3.11) rt = I+ [ r'^"[^udu + aui'tdMu-/:]. 

J]t,s] 
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Before proving this, we establish the following results: 

Lemma 3.12. The adjoint process defined by (3.11) will satisfy, for t < 
r < s, 

Proof. Assume without loss of generality t < s <T. Write 
dVu = /3udu + auipt dM„7*. 
Then F is defined by the forward SDE, 

rt = i+ [ r^-dVu. 

J]t,s] 

If H is Tt measurable in M^^^, then 

L = HT 

is the unique solution to 

Ll = H+ [ Vl-dVu. 

Suppose t < r, and define 



Ff , for t <s <r, 

fort<r<s. 



Then 



for t < s <r, and 



Dt = I+ [ Dt-dVu 

J]t,s] 

Dt = ri+ [ DrdVu 

J]r,s] 

= 1+ [ D^dVu 

J]t,s] 

for t <r < s. By uniqueness, for t <r < s, this implies Df = Ff = F[F^,, as 
desired. □ 



We now discuss how solutions of first-order matrix differential equations 
can be expressed as product integrals. 
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Lemma 3.13. Consider a deterministic first-order differential equation 
of the form 



(3.12) Gi = I+ / C'-Hudu. 

J]s,t] 

Here G and H are K xK matrix-valued functions, H is bounded and Lehesgue 
integrahle and s <t. Then (3.12) has an invertible solution, which can be 
expressed as a product integral, denoted by 

Gi = U],^t]{I + Hudu). 

If Ht has nonnegative entries off the main diagonal for dt-almost allt, then 
this solution has all entries nonnegative. 

Proof. As H is bounded and Lebesgue integrable, we can appeal to 
the theory of the product integral. Using this, 

(3.13) Gl = U],^t]{I + Hudu)= limY\ll+ [ H^du], 
where, for each n, (sn) is an arbitrary partition of ]s,t] into n parts, 

S = (Sn)o < (Sn)l < (s„)2 < • ' • < (Sn)n = t, 

converging, as n — )■ oo, in the sense that 

hm max|(s„)j - (s„)i+i| = 0, 

n— >-oo I 

Y\ indicates products taken sequentially on the right and the limit is taken 
in the matrix norm Tr(G*(G*)*). This is called the product integral of H, 
exists by [13], Theorem 1 and solves the integral equation, (3.12) by [13], 
Theorem 5. 

Gg has an inverse, given by 

(3.14) (G*)-i=lim H^du] 

as noted in [12], page 134. 

Finally, if H has nonnegative entries off the main diagonal dt-a.e., as H is 
bounded, for sufficiently large n, I], ^ ^ i Hudu has all diagonal entries 
greater than —1 for all i. Therefore, 



/+ / Hudu 

is nonnegative for all i. Consequently, the product in (3.13) must be non- 
negative. As the set of matrices with nonnegative components is closed, the 
limit li^ g -\- Hudu) will also have all components nonnegative. □ 
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Lemma 3.14. Let t be a time at which X jumps, that is, AXt 7^ 0. Then 
F{AXt = ej - Xt^ for some j with e*AtXt^ > 0) = 1. 

Proof. Recall that ejAuCi is left continuous and lies in [£^,6"^] U {0}. 
If for some time u we have CjAuCi = 0, then there exist r <u< s such that 
ejAuCi = for all u G]r, s]. As A is left continuous and X is cadlag, A and 
X have countably many discontinuities, and therefore, for each ei,ej, there 
is a countable set of stopping times r„, Sn such that Xr„ = e, and 

e*Auei = for all u G ]r„, 

It is then clear that, for a given pair ei,ej, 

{u : e*AuXu~ = and = eJ = [J]r„, s„]. 



For a given n, let 



T = min{i > r„ : AXt 7^ 0} A s„. 



Then 



and hence 



E[Xr\J'rJ=Xr„+E 



AuXu- du 



E\e*AX^\ = E 



E 



e*AuXu- du 



0. 



It is clear that the left-hand side of this equation is positive if and only if 
P(AX,- = ej — ej) > 0. Hence we have shown that 

Y{AXt = Cj — Ci for some t € ]r„, s„]) = 0. 

Taking a sum over all i,j and the countable index n gives the desired 
result. □ 



Lemma 3.15. Suppose that for all s G]i,T], 

/ + a,^+(ej-X,_)7: 

is invertible for all j such that e*AsXs- > 0, except possibly on some evanes- 
cent set. Then the adjoint process defined by (3.11) is invertible (except 
possibly on this evanescent set). 
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Proof. By the definition of M in (1.1), we can rewrite (3.11) as 

J]t.s] 

(3.15) 



]t,s] 
t<u<s 



When AXu = 0, (3.15) is of the form of a classical, deterministic, linear 
matrix differential equation, with nonsingular solution T^^ given by the 
analogue of (3.13). Therefore, if two consecutive jump times are so,si, for 
s E [so, si[ we have 

Tt = rrn],„,,]{/ + [/3, - a^i;tA^X^.j:] du} = 

by Lemma 3.13. If is invertible, then 

(3.16) (r?)-^ = (rfj-i(rf)-i 

exists. 

At each jump time, (3.15) implies we have 



Hence 



r? = rf-)(/ + a,V'^Ax,7: 



By Lemma 3.14, at each jump time s, 

F{AXs = Cj - Xs- for some j with e*AsXs^ > 0) = 1. 

By assumption, I + asipf{ej — Xc,_)7* is invertible for all such j (up to 
evanescence), and 

(3.17) (rf)-i = {I + as^tAX,j:)-HTtY'. 

The process X almost surely has finitely many jumps in [0,T]. Through 
a process of induction using the starting value L^ = / and (3.16) and (3.17), 
we can conclude Tf is invertible (up to evanescence). □ 

Theorem 3.16. Suppose 13^ — auip^ A^Xu-'j* has all nonnegative com- 
ponents off the main diagonal P x du-a.s., for u E ]i, T], and, except possibly 
on some evanescent subset of x ]t,T], 

I + auipt{(ij - Xu-)l*u 

has nonnegative components for all j such that e*AuXu~ > 0. Then the 
adjoint process Tf has all entries nonnegative for all s E]t,r], up to evanes- 
cence. 
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Proof. As above, if two consecutive jump times are so,si, for s € [sq, si [ 
we have from (3.15), 

Tt = Tt°^so,s]{I + Wu - a^^Pt^uXu-i:] du}. 

By assumption, [I3u — au'4'i ^uXu-Ju] has nonnegative components off 
the main diagonal, and therefore by Lemma 3.13, T^^ has nonnegative com- 
ponents. The product of matrices with nonnegative components has nonneg- 
ative components, so if F^" has nonnegative components, has nonnegative 
components. 

At each jump time, we have from (3.15) that AFf = f|* ^agipt^Xsj*. 
This imphes 

and the term in parentheses has ah nonnegative components by our assump- 
tion and the argument of Lemma 3.15 regarding the values of AXs- 

The process X almost surely has finitely many jumps in [0,T], and there- 
fore through a process of induction using the starting value F* = /, we con- 
clude that F| has nonnegative entries up to evanescence. □ 

Corollary 3.17. The conditions of Theorem 3.16 are necessary for 
to have all entries nonnegative for all s and t. 

Proof. First consider the case when s is not a jump time. Then, as 
noted by [3], page 176, for small 6 > 0, 

TUs = I + Ws^s - assists As-sX^,_s).^:]5 + 0(6^). 

Hence if [f^ss — Ois~-s'4't-5^s-5X(s~S)~lu] negative components off the 
main diagonal, then so will F^_^. 

For s, a jump time, again for small 6 > 

Tl_s = I + asi;t^Ws+0{6). 

The result follows. □ 

A useful result when applying this lemma is the following. 

Lemma 3.18. For a nonnegative column vector x € and any basis 
vector Ci E M^, the matrix I + xe* has nonnegative components and is in- 
vertihle. 

Proof. Clearly, I -\-xe* has nonnegative components. For I + xe* to be 
singular, there must be a nontrivial linear combination of its rows equalling 
zero. Suppose yj denotes the jth row of / + xe* and '}2if=i '^jUj = 0- ^s. 
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for each j ^ i, yj is the only row containing a nonzero entry in the jth 
place, we must have Cj = for j 7^ i. Therefore we must have Ciin = 0. By 
assumption, x is nonnegative, so = (1 + e*x)ei / 0, therefore q = also. 
Therefore there is no nontrivial linear combination of the rows of / + xe^ , 
and so / + xe* is invertible. □ 

Lemma 3.19. Under the conditions of Theorem 3.11, 

sup£;[||r^f] <+cx) 



and 



sup£? 

t 



J\o,t\ 



du 



< +00. 



Proof. We first show sup^£^||ro|| < +00. From (3.11) 



E[\\Tir]<2\\ir + 2E 



T^-lPudu + aui^t dMull] 



2n 



<2||/f + 4 / E[\\Tl-f\E[W^f]du 
J]o.t] 



+ 4E 



The last term of the right-hand side can then be decomposed using Lemma 
3.1, to give 

iE 



1 


v^u— „ 

a 




= AE 


[/ 




U]o,t] 


<AE 


[/ 




U]0,t] 



l]0,t] 

We therefore have 



(7:7jTY((r^)*r^)TrKV'^V'.(V'^)*«:)d^ 

<4 / E[\\^^~f]E[\\J^f\\au^Pt\\l^_]du. 
J]o.t] 



E[\\rlf]<2\\lf + 4 [ i?[||rM|2]E[||/3J|2 + ||7,f||a,V'^llLJ'^^ 
J]o.t] 



and Gronwall's lemma yields 



^[||r*|p]<2||7||2exp{4i5;[/ ||/3„||2 + ||^„|j2||„„^4 
I U]o,t] 



du 
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As a, /3,7, tp and -0"*" are all du x P-a.s. bounded (by assumption and Lemma 3.3), 
the right-hand side of this is finite for all t. As it is also increasing in t, 



supE[\\rlf] <2||lf exp 

i 



I U]0,T] 



+ llTwlPlla^^^ llx„_ du 



< +00. 



To show the second statement, note 

2 



E 



Tq (j)udu 



<2E[\\rlf]E{\\Ytf]+2 [ E[\\r^-f]E 



[WcpuW^] du. 



Hence 



supE 
t 



rlYt+ [ r^-<Pudu 

J]o,t] 



<2(supE[\\rlf[ 



sup^[||yf| 
t 



+ 2 sup^[||r: 

^ II. 



u~ ||2- 

II - 



E 



\(t>u\\ du 



0,T] 



and each of these terms is finite. □ 



Proof of Theorem 3.11. It is clear that (3.8) is of the form of (1.3), 
where F{uj, t, Yt- ,Zt) = (j)t + PtYt- + o^tZtj^ is a Lipschitz continuous, square 
integrable driver for the equation. The uniqueness of the solution follows 
from Theorem 1.1. We now search for the closed- form solution. 

The first required result is that 



o,t] 



TQ(j)u du 



is a uniformly integrable martingale. To see this, observe that in this context 
all integrals are Stieltjes integrals, and hence 

r'oYt = TlYo+ [ [T]^~dYu + dT^Yu-]+ V Ar^AF, 

= r°Yo- [ r^-[^u + f3uYu- + auZ:ju\du+ [ r^z„dM„ 

J]0,t] J]0,t] 

+ [ r^-i3uYu-du+ [ rra,^+dM„7:y„_ 

J]0,t] J]0,t] 

+ r^"a„V^AM„7:Z„AM„. 

0<u<t 
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Then, as a 1 x 1 matrix is its own transpose, 

The quantity AM^AM* is equal to d[AI,M]u, the measure induced by 
the optional quadratic covariation matrix of M. d{M,M)u is then the dual 
predictable projection of d[M,M]u (see [8]). Therefore, 



. du 



/ Tq (f)u du = Lt- Tq auZ*ju < 
J]o,t] J]o,t] 



for some local martingale L. It follows that 



du 



and from Lemmas 3.3 and 3.5, tp^ipuZ* = Z* without loss of generality, so 
the latter of these terms is zero, and, therefore, the left-hand side is a local 
martingale. 

By Lemma 3.19, TQYt + J^^^ ^TQ(j)y,du is square integrable, and hence is 
a uniformly integrable martingale (see [14], page 12). Hence it is indistin- 
guishable from the conditional expectation of its terminal value (see [14], 
page 11) 



J]0,i] L J\0,T\ 



Through standard calculations and the use of Lemmas 3.12 and 3.15, we can 
conclude that, up to indistinguishability. 



Yi = E 



rfQ+ [ ri(t>udu 

J]t,T] 



□ 



Corollary 3.20. If Q and (j) are nonnegative, and the assumptions of 
Theorem 3.16 are satisfied, then Y is nonnegative. If, in addition 10 = 0, 
then, Y = up to indistinguishability (and hence Q = 0, ¥-a.s.) and (pt = 0, 
¥ X dt-a.s. 



Proof. This follows from Theorem 3.11 and the nonnegativity result 
of Theorem 3.16. The strict comparison is trivial, given the invertibility of 
r* for all s and t and the fact that Y is cadlag. □ 
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4. A scalar comparison theorem. We shall now establish a comparison 
theorem relating the solutions of two BSDEs in the scalar case. 

Remark 4.1. In the scalar case, when K = 1, the assumptions of The- 
orem 3.16 simplify considerably. The assumption of nonnegativity of 

off the main diagonal becomes trivial (as there are no off diagonal terms) 
and the assumption that 

is P-a.s. invertible and nonnegative for all j such that ejAgXg- > can be 
simplified to 

(4.1) a,V^(e,-X,_)>-l 
as 7 = 1, without loss of generality. 

Theorem 4.2 (Scalar comparison theorem). Suppose we have two stan- 
dard, scalar (K = \), BSDEs corresponding to coefficients and terminal val- 
ues {F^,Q^) and {F'^,Q'^). Let (Y^,Z^) and {Y^,Z^) be the associated so- 
lutions. We suppose the following conditions hold: 

(i) > P-a.s.; 

(ii) dt X P-a.s., 

{u, t, Yl , Z2) >F\u;, t, , Z^); 

(iii) there exists an e > such that F-a.s., for all t £ [0,T], if , Zf are 
such that 

{e*AtXt-)[Z} - Z'l\(ej - X^.) > -e\\Z} - Z^\\x,_ 

for all Cj, then 

F\u,t,Yl,Z})-F\uj,t,Yl,Z^) > -[Zj - Z^MtXt- 

with equality only if \\Z^ — Zf\\xt_ =0. 

It is then true that Y^ > Y'^ P-a.s. Moreover, this comparison is strict, 
that is, if on some A G Tt we have Y^ = Y^ , then = P-a.s. on A, 
F'^{u,s,Y^,Z'^) = F^{uj,s,Y^,Z^) dsx¥-a.s. on [t,T] xA andY^ is indis- 
tinguishable from on \t,T] x A. 

Remark 4.3. Note that assumption (iii) need only hold for Z^ and Z"^; 
there may well be other processes Z for which it fails. This will correspond 
with various types of dominance, as will be seen in Section 7 below. In 
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practice, however, it may be more convenient to assume that this assumption 
holds for all and Z^, as it is not clear how one might show this condition 
holds for a particular (F, Q) without first finding the solution processes 
(y, Z) analytically. 

An intuitive interpretation of this assumption is, "consider the difference 
between the SDEs with starting value y^^, trend F(a;, i. It, Z^*), and hedging 
processes Z\ and Z^. If the only sizeable jumps that can occur in this differ- 
ence are positive, then the overall trend through time should be negative." 



Proof of Theorem 4.2. We can write 



(4.2) 



yt-yt- [ [F\u;,u,Y^_,Zl) - F\uj,u,Y^_,Zl)]du 

J]t,T] 

+ [ [Zl-Zl]dMu = Qi-Q2. 

J]t,T] 



Taking (4.2), the equation satisfied by 6Y := Y^ - Y^ and 6Z := Z^ - Z^, 
we shall form an equivalent linear BSDE for 6Y and apply Corollary 3.20 
to prove the desired result. For notational simplicity, we shall omit the uj,t 
arguments of F^, F"^ as implicit. We also define 0/0 :=0 wherever needed. 

Without loss of generality, we can assume 



e<e\ 



3/2^-3/2 



where is as in (3.1). 
We consider three cases. 

1. If F^{Y^_,Zl) - F^{Yt^_,Z^) > 0, then let 

<Pt = F\Yl,Zl)-F\Yl,Zf), 
_ F\Yl,Zl)-F\Yl,Zl) 

at = 0, 
It = 1- 

Note that assumption (ii) and the fact F'^{Yl_,Zl) - F'^{Y^_,Zl) > 
implies that (/)t > 0. 

2. If F^Y^^_,Z^) - F\Yl^,Zf) < and there is a j such that 



''J 

then let 



e*AtXt-)[Zl - Zj]{e, - Xt_) < -e\\Zl - Z?\\x,. 
Pt = F\Yl,Z^)-F^iYl,Z?), 
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FHYl,Zl)-FHYl,Zl 



FHYl,Zi)-FHYl,Zf) 

= [Zl-Z?]^,e, 

It = 1- 

3. U F\Yl,Zl) - F'{Yl,Zf) <0 s.nd 

(4.3) {e*AtXt_)[Zl - Z^]iej - X^.) > -e\\Zl - Z^\\x,_ 

for all j, then let 

cPt = FHYl,Z?)-F\Yl,Z^), 
^ _ F\Yl,Zj)-F\Yl,Zl) 

F\Yl,Zl)-F\Yl,Z^) 

[Zt - ztmXt- 

it = 1- 

In all three cases, it is clear that 

F\Yl,Zj) - F\Yl,Z?) = ^t+ myt-) + MSZtTit, 

and so the linear BSDE with these values of (pt, A, at and "ft is equivalent 
to (4.2). 

Furthermore, -©[jj^^j < +oo as F is standard, and in each case, 

A) Oit and 7j are dt x P-a.s. bounded. This is trivial for 7^ and follows directly 
from Lipschitz continuity for (3t in all cases. 

In case 1, at = 0. In case 2, we know that ipt^j = {e*jAtXt-)[ej — Xt-\ 
by the definition of "0 in (1-4). By assumption, the absolute value of {Z} — 
Z^]tptej is then at least e\\Zl — Z^^||xt„, and therefore at is bounded by 
Lipschitz continuity. 

In case 3, by Lemma 3.6 and the fact F^(Y^,Zl) ^ F^ {Y-l^_ , Zf) , we know 
\\Z^ — Z^\\xt_ 7^ 0, and so we are in precisely the situation considered in Lem- 
ma 3.10. Therefore [Zj — Zf]iptXt- < —£\\Zf — Zf\\xi_-, and so Lipschitz 
continuity implies the boundedness of at in case 3. 

We have assumed that F^ and F"^ are standard, and hence P-a.s. is left 
continuous in t, as are Y^_ and Zl for i = 1,2. By construction, we see that 
the variable Ct indicating which of cases 1, 2 and 3 is in force at any time 
t is predictable. Furthermore, the processes a, (3 and (f) defined in each case 
are also predictable. We then use Ct to piece together the various definitions 
of a, (3 and (j) to give a single predictable linear BSDE, with the same driver 



BSDES ON MARKOV CHAINS 



23 



values as F^{Ytl^,Zl) - F'^{Y^_, Z"^), dt x P-a.s. This linear BSDE satisfies 
all the requirements for Theorem 3.11. 

We now appeal to Remark 4.1 to determine that the only requirement to 
show nonnegativity of 5Y is that, for all t, atipf{ek — Xt-) > — 1 for all k 
with elAt-Xt- > 0. 

In case 1 this is clear. In case 2, we can write 

amiek -Xt_) = e*iJt[Zl - Zf]* ^i^*^* (^^ - Xt-), 

and, by Lemma 3.4, e*iljt4''^ {ek — Xt-) = e*{ek — Xt-) which is zero unless 
k = j. If /c = j we see that 

, + , ^ , F'iYl,Zl)-F\Yl,Zi) 
atWlGk — Xf- = — 1 > U 

by construction. 

In case 3, we know from Lemma 3.10 that [Z^ — Z^]'i/jtXt- is negative. 
For any k satisfying Lemma 3.4, this implies 

iJti^t {ek - Xt- ) (efc - Xt- ) 



xt_Uzl-z^Y xt_Mzl-Z-^ 



is positive. We then use assumption (iii) of the theorem, along with Lemma 
3.7, to show that 

atiptisk - Xt-) 

_ F'{u,t,Yl,Zl)-FHoo,t,Yl,Zi) ^ 

[Zt - ZfliptXt- 

[Zl-ZMtXt- 

> ~T7i -721 ; V ^t-[(^k - Xt-) 

[Zt - Zfl^tXt- 

[Zl - Zf]i;tXt- 
[Zl - Zl\^tXt- 
= -1 

as desired. 

Therefore, we have shown that SY = Y^ — Y^; the difference of our pro- 
cesses satisfies the requirements of Remark 4.1. That is, the assumptions of 
Corollary 3.20 are satisfied by this process and 5Y is therefore nonnegative 
P-a.s. The rest of the theorem also follows by Corollary 3.20. □ 

Remark 4.4. In general assumption (iii) cannot be omitted, and is 
closely related to various geometric interpretations of no arbitrage (see Sec- 
tion 7). The fact that it is possible to create such Z^ and (due to the lin- 
ear redundancy in M), indicates a significant difference between the Markov 
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chain theory considered here and that based on Brownian motion considered 
elsewhere. 

4.1. Scalar counterexamples. Suppose the theorem were to hold without 
assumption (iii). Then we can create contradictions as follows. 

Example 4.1. Take any standard F such that there is Y^, Y'^, and 
Z'^ with Yfl = Y^, [Zl - Zf]{ej - Xt-) nonnegative for all Cj with e*AtXt- > 
and positive for at least one such j, and 

F{u,t,Yl,Z})-F{uj,t,Yl,Z^) < -[Z^-ZMtXt- 

on some (stochastic) set of positive dt measure. Without loss of generality, 
we shall assume that this set is the stochastic interval [0,5], where S is an 
essentially bounded stopping time. Extend F to [0, T] for some deterministic 
time T as in Section 2. 

Then define and by the forward stochastic differential equations 



Yt' -I Fioj, u, Y^_ , Zl) du+ [ Zl dM^ = , 
Yt^-f F{u:,u,Y^_,Zl)du+ I ZldM^ = Q\ 



]t,T] J]t,T] 

If we consider the process 6Y ■.= Y^ — Y'^, we can see that 6Y is nonde- 
creasing over [0,r] with probability one, and is increasing with positive 
probability. This is because at every jump time s, 

A{5Y), = [Zl - Z^,]AXs = [Zl - Z^]iej - 

for some j which is nonnegative a.s. and positive with positive probability, 
and at every other time, the derivative of 6Y is equal to 

= -Fiu;, u, Y^_,Zl) + Ficv, u, Y^^Zl) 

— zIAuXu- + Z^AuXu- 
>0 

by assumption. Therefore > P-a.s., and > with positive prob- 
ability. However, the initial value, 5Yq = 0, would imply, by the strict com- 
parison of the theorem, that = P-a.s., which is a contradiction. 

Example 4.2. For a given Markov chain, X, with three states where 
the rate matrix is such that the jump rate between any pair of states is 
nonvanishing, define the process {Jt} which counts the number of jumps in 
X strictly prior to time t. Let 



Zl = [2Vl-2-2(^*+2)e* - 2-(-^'+i)e* 



BSDES ON MARKOV CHAINS 



25 



where e^^, e^j and Xt- are the three basis vectors in M^. Let = and 
F^ = F^ = F{uj,t,Yt.,Zt) = -\\Zt\\x,^ - ZtAtXt-. 

This particular combination for Z^ — always has the possibility of 
negative jumps, however, these decrease exponentially in magnitude while 
II — Z'^Wxt^ = 4 is constant. For Jt sufficiently large, this will fail to satisfy 
assumption (iii) for any fixed e. The sizes of the negative jumps are such 
that their sum will always be more than —1. 

Consider the forward SDEs defined by these Z^ and Z'^ with starting 
value = = 0; for terminal time T = 1, = and > 3 P-a.s., 
contradicting the theorem. 

Remark 4.5. These examples also demonstrate the interrelationship 
between the scalar comparison theorem, assumption (iii) and dominance. 
This is explored further in Section 7. 

5. General comparison theorems. We now wish to extend Theorem 4.2 
to the vector case. This is nontrivial, as the simplifications of Remark 4.1 are 
not possible, and we must satisfy the more difficult conditions of Corollary 
3.20 directly. We also do not have the useful result of Lemma 3.10. We 
present here some alternative generalizations. 

Theorem 5.1 (Vector comparison Theorem 1). Suppose we have two 
standard BSDE parameters, (F^Q^) and {F^,Q^).Let {Y^,Z^) and {Y'^,Z^) 
be the associated solutions. 

Recall that a BSDE here has F{u, t, Yt-, Zt),Yt-,Q G and Zt G M^^^ 
where N is the number of states of the Markov chain X . 

We suppose the following conditions hold: 

(i) > P-a.s.; 

(ii) dt X P-a.s., 

F^ {uj, t, Yl , Z2) >F\lo, t, Yl ,Z^y, 

(iii) there exists an e > such that P-a.s., for all t G [0,T], for any basis 
vector efe G , if Z} , Z^ are such that 

{e]AtXt.)el[Zl - Z^]{e, - Xt-) > -e\\el[Zl - Zj]\\x,_ 

for all Cj, then 

el[F\u;,t,Yl,Zl) - F\u;,t,Yl,Z^)] > -el[Z} - Z^]AtXt^ 
with equality only if ||e^[Z/ — Zf]\\xt_ =0; 
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(iv) for each i, e*F^ can be written as a function 

e*F' iLo,t,Yt^,Zt) = Fl{u,t, e*Yt^ , e* Zt), 

that is, the ith component of F^ depends only on the ith component of Y 
and the ith row of Z (and to and t ); and similarly for F"^ . 

It is then true that Y^ > Y'^ P-a.s. Moreover, this comparison is strict, 
that is, if on some A G Ft, we have Y^ = Y^ , then = P-a.s., on A, 
F'^{uj,s,Y^,Z^) = F'^{uj,s,Y^,Zl) ds x P-a.s. on [t,T] x A and and Y^ is 
indistinguishable from Y"^ on \t,T\ x A. 

Proof. This can be established using the results of Theorem 4.2. From 
(1.3), we know that, with 5Y := Y^ - Y^, bZ := Z^ - Z^, 

Q^-Q'^=bZt - [ [F\u,u,Zl_,Y^) - F^{uj,u,Zl_,Y^)]du 

J]t,T] 

+ / 5YudMu, 

J]t,T] 

which implies, for each i, 

= e*6Yt- [ e*[F'{u:,u,Y^_,Zi)-F\Lo,u,Y^_,Zl)]du 

J]t,T] 

+ I e*5ZudMu 

J]t,T] 

= e*5Yt - [ [Fl (^, u, e\Yl_,e\Zl) - F^{u;, u, e*Y^_,e*Zl)] du 

J]t,T] 

+ [ e*5Z^dMu, 

J]t,T] 

which is of the form considered in Theorem 4.2. As all the assumptions 
needed in the scalar case are given, the result is shown for each component 
e*6Y by Theorem 4.2. As this holds for all i, the vector result follows. □ 

Corollary 5.2. For any pair of BSDEs satisfying assumption (iv) of 
Theorem 5.1, if assumptions (i)~(iii) are satisfied by any component of the 
terminal condition and driver, then the comparison theorem will hold on that 
component. 

Proof. This is a direct result of applying Theorem 4.2 to the component 
in question. □ 
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It is clear that this theorem does not provide for much more behavior 
than the scalar case, as it is considering each component separately from 
the others. Removing this requirement is nontrivial, but can be done. To do 
so using the machinery of the solutions to linear BSDEs, we must make the 
following, in some sense stronger, assumption. 

Theorem 5.3 (Vector comparison Theorem 2). Suppose we have two 
standard BSDE parameters {F^,Q^) and {F^,Q^). Let (Y^,Z^) and {Y'^,Z^) 
be the associated solutions where the dimensions of F , Q, Y and Z are as 
in Theorem 5.1. We suppose the following conditions hold: 

(i) > F-a.s.; 

(ii) dt X F-a.s., 

F' (lo, t, Yl , >F\u, t, Yl ,Zl)- 

(iii) there exists an e > such that, ¥-a.s., for all t € [0,T], for any basis 
vector Cj € , if 

{elAtXt.)e][Zl - Zfjie^ - Xt.) > -e\\Zl - Zf\\x,_ 

for all gfc 7^ Xt^ , then 

e* [F' (cu, t, Yl ,Zl)-F\u;,t, yI , Zl)\ > 0; 

(iv) for each i, e*F^ can be written as a function 

e*F\uj, t, Yt-,Zt) = Fl{u, t, e*Yt.,Zt), 

that is, the ith component of F^ depends only on the ith component of Y 
(and on uj,t and Zt). 

It is then true that Y^ > Y'^ P-a.s. Moreover, this comparison is strict, 
that is, if on some A G Ft we have Y^ = Y^ , then = F-a.s. on A, 
F^{uj,s,YlZ'^) = F'^{uj,s,YlZ'^) dsxF-a.s. on [t,T]xA, andY^ is indis- 
tinguishable from Y"^ on [t,T] x A. 

Remark 5.4. The key differences between Theorems 5.3 and 5.1 are 
that in Theorem 5.3, a weaker assumption is placed on the behavior of the 
driver F^ in relation to interactions between the rows of the Z matrix, but 
a stronger assumption is placed on the behavior of a component of F^ when 
no jump is significantly negative in that component. We shall also see that 
this weaker assumption leads to a significantly weaker result — the strict 
comparison cannot here be shown to hold componentwise. 



28 S. N. COHEN AND R. J. ELLIOTT 

Remark 5.5. Assumption (iii) in Theorem 5.3 is a stronger assumption 
than for Theorem 5.1. In the scalar case, which is the foundation of Theo- 
rem 5.1, we used Lemma 3.10 to show that each row, e*[Zl — Zf]tptXt-, is 
negative, and our assumption was equivalent to 

el[F\u;,t,Yl,Z^) - F\u;,t,Yl,Z?)]>el[Zl - Z^]ijtXt., 

which is clearly satisfied when 

el [F' iiv,t, Yl ,Z\)- F' (a;, t, yI ,Z?)]>0. 

Proof of Theorem 5.3. For simplicity of notation, again let 6Yt = 
Y^ — Y^, 6Zt = Zl — We also will omit the w and t arguments of F 
as implicit. Note that here (/>,7 € M-^, /3 e ^^"^^ and a £ M^^^. We wih 
construct a, (3, 7 and (f) in two stages. 

1. Consider first those components of F^{Y^,Z^) — F^{Y^_, Z"^) which are 
nonnegative. Let Cj indicate each such component in turn. Then define 
the corresponding components of (j)t and /3(, and rows of a^, by 

e*<Pt = e*[F\Yl,Zl) - F\yI,Z^)] > 0, 




e*at = 0. 

It follows that E[f^Qj,^ \\e* (ptW"^ dt] < +00 as F^,F'^ are standard, e*/3j is 

dt X P-a.s. bounded by Lipschitz continuity of F^, and they will satisfy 
the requirements of Corollary 3.20. 
2. If all components are dealt with by the above, then we have completed 
the construction. Otherwise, for each time point t, let jt indicate the 
smallest of the remaining components of 6Yt- in absolute value, that is, 
7t is the basis vector such that |7^*(5Y4_| < |e*(5Yj_| for all j. 
For each of the remaining components, we now define 

e*cPt = e* [F' (Yl ,Z?)- F\yI , Z?)] > 0, 

eliFHYl^Zj) - F^{Yl,Zl)]- Bjtm^] , , ^ , 
e. A = + B,, 

for a large, fixed number B > ce~^e~^, where c is the Lipschitz constant 
of F^, e is as in assumption (iii) of the theorem, and £r is as in (3.1). As 
7t indicates the smallest component in absolute value of Yt„, Lipschitz 
continuity again guarantees that this will remain bounded. 
By assumption (iii) of the theorem, recalling 

Tptek = {ekAtXt-)[ek - Xt-], 
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either for some ^ Xt 



(5.1) 



t\\Xt- 



or 



^nPHYt-^Zl) - F\Yl,Z^)]>0. 
However, we have selected 74 such that 
(5.2) jnFHYl,Z^)-F\Yl,Z?)]<0 
and therefore (5.1) must hold. Define 

'enFHYl,Zl)-F\Yl,Zf)] 



fij at 



j^[SZt]iptek 



This is dt x P-a.s. bounded by Lipschitz continuity and (5.1). As e^jpt'ipf {cj 
Xt^) > for ejjCfc 7^ Xt^, and the fraction in parentheses is positive [by 
(5.1)] we have that 

I + akiP+{e,-Xt^)^* 

is an invertible matrix with nonnegative components by Lemmas 3.18 
and 3.4. 

For the rows defined in the first case, we have /Sj only having elements 
on the main diagonal, and at having a corresponding row of zeros. Hence 
Pt — atipf AtXt-Jt has only entries on the main diagonal. For the rows 
defined in the second case, we have, for each row i, 



\e*at'4)t AtXt. 



< 



et[F\Yl,Zl)-F\Yl,Z^)] 
-il[5Zt]ilJtek 
el[F\Yl,Zl)-F\Yl,Zf)] 



elMt^tXt- 



-f^[5Zt]tptek 



< B. 



Therefore the ith row of f^t — atipf AtXt-Jt has an entry on the main di- 
agonal, an entry of B — e^atipf AtXt^ > in the 74 column (when this is 
off the main diagonal), and zeros elsewhere. Hence fit — atipf AtXt-'^l has 
nonnegative entries off the main diagonal. 

We therefore have defined, for each time i, random variables (pt, (3t, at 
such that 



F\Yl,Z}) - F\Yl,Z^) = cPt + lit5Yt- + atdZ^^it. 
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As in the scalar case, we can now piece together these /3f, at and 7t 
to give predictable processes, <j), f3, a and 7. We have established that /?, a 
and 7 are dt x P-a.s. bounded. As F is standard, (p is also square integrable, 
and is nonnegative by construction. 

Therefore, up to the desired dt x P-a.s. level of certainty, the requirements 
of Corollary 3.20 are satisfied for the linear BSDE solved by 5Y. The result 
follows. □ 

Remark 5.6. This proof is significantly weaker than that of Theorem 
5.1. In particular. Corollary 5.2 has not been established in this case. It would 
appear intuitively reasonable, from a geometric perspective, that it should 
hold, and also that the weaker conditions of Theorem 5.1 assumption (iii) 
should be sufficient. However, the machinery of appealing to the solutions 
of linear BSDEs does not appear to be adequate to prove these results. 

The assumption that the ith. component of F can depend only on the 
ith component of Y may be overly restrictive. Because of this, we have the 
following alternative generalization, where we instead assume that F does 
not depend on Z. 

Theorem 5.7 (Vector comparison Theorem 3). Suppose we have two 
standard BSDE parameters {F^,Q^) and {F^,Q^).Let iY^,Z^) and iY^,Z^) 
be the associated solutions. We suppose the following conditions hold: 

(i) > P-a.s.; 

(ii) dt X P-a.s., 

F^ {u, t, Yl , Zl) >F\u;, t, yI ,Z^y, 

(iii) there exists an e > such that, dt x P-a.s., for each i, if 

e*F' {oj, t, Yl , Zt) < e*F' {cj, t, Yl , Zt), 

then either 

\e*[Yl-Yl]\>e\\Yl-Yl\\ 

or there is a j with 

e*[Yl-Yl]<-e\\Yl-Yl\\; 

(iv) F^ does not depend on Z (or equivalently, by Lemma 3.9, F^ depends 
only on the row sums of Z , not on its individual elements). 

It is then true that Y^ > Y^ P-a.s. Moreover, this comparison is strict, 
that is, if on some A G J^t we have Y^ = Yl then = P-a.s. on A, 
F^{uj,s,YlZ^) = F'^{uj,s,YlZ'^) dsx¥-a.s. on [t,T] xA andY^ is indis- 
tinguishable from on [t,T] x A. 
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Proof. We have that 

f [fHu;,u,Y^_,zI) - F\iv,u,Yt,Zl)]du 

J]t,T] 

(5.3) 

+ / [Zl-Zl]dMu = Qi-Q2. 

J]t,T] 

As before, let 6Y = — Y'^, and omit the uj and t arguments of F as 
imphcit. 

We first seek to construct an process (fft and an M^^^ process pt such 
that the hnear BSDE with these components matches (5.3). As F^ does not 
depend on Z, we shah see that the a term in this BSDE is zero. Again, 
consider first those components, represented by Cj, where 

enF\Yl,Zl)-F\Yl,Z})]>0. 

For these rows, define 

e*cPt = e*[F\Yl,Z^) - F\yI,Z?)] > 0, 

(5.4) 

e*f3t = 0. 
For the other rows, first define 

e*<Pt = e,[FHYl,Z^) - F\yI,Z?)] > 0. 
As we are in the situation considered in assumption (iii), either 

\em--Yl]\>e\\Yl-Yll 

in which case define 

(5 5) ^.,_ enFHYl,Z?)-FHYl,Zf)] 
or there is a j with 

e*[Yl-Yl]<-e\\Yl-Yl\\, 

in which case, define 

.... e*[F\Yl,Z?)-F\Yl,Zf)] , 
(5.6) e,/3, = e,. 

In both cases, F is Lipschitz continuous, and so the matrix f3t is bounded. 
We have constructed (3t such that all off diagonal entries are are either zero 
[as in (5.4) and (5.5)], or are the ratio of two negative quantities [as in (5.6)], 
and will therefore be nonnegative. 
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We now note that, as assumption (iv) states, does not depend on Z, 
we have 

{Yl ,Z})- F' {Yl , = F^ [Yl ,0) - F' [Y^ , 0) 

= 0. 

We again, as in Theorems 4.2 and 5.3, piece together these random vari- 
ables (j)t and /3t into a pair of predictable processes. As shown above, the 
process /3 is bounded and has nonnegative quantities off the main diagonal, 
the process (j) is nonnegative by construction and Elj^^j,^ \\(j)t\\^ dt] < +oo as 
F is standard. 

The process 5Y then satisfies the linear BSDE, 

5Yt - [ [cPt + l3t6Yu-] du+ [ \Zl - Zl] dMu = Qi - Q2 > 0. 

J]t,T] J]t,T] 

As /3t has nonnegative entries off the main diagonal and aj = 0, the con- 
ditions of Corollary 3.20 are satisfied. The result follows. □ 

This theorem is counterintuitive, and when examined closely would ap- 
pear to create a contradiction. The only resolution to this is the following 
corollary. 

Corollary 5.8. For a pair of BSDEs satisfying Theorem 5.7, the strict 
comparison must hold componentwise. That is, if for some t and some Ci, 
on some A e Ft we have e*y/ = e*y/, then e*Q^ = e*Q'^ F-a.s. on A, 
e*F^{uj,s,Y^,Z^) = e*F^{uj,s,Y^^,Z^) ds x F-a.s. on [t,T] x A and e*Y^ 
is indistinguishable from e*Y'^ on [t,T] x A. 

Proof. Without loss of generality, we shall assume that i = 1, t = 0, 
A = fl. Then the assumptions of the theorem state c^Yq = e^Y^, and from 
Theorem 5.7, Y^ > Y^ P-a.s. for all s > 0. These assumptions immediately 
imply that, for small s, the process e^Yg must be P-a.s. nondecreasing. 

We can then rewrite (5.3) as a forward SDE for dY = Y^ —Y'^, with initial 
time sq, omitting the w, s and Z arguments of F as implicit or irrelevant, 

(5.7) 6Ys = 6Ys, - [ [F\Y^_) - F^iY^_)] du+ f [Zl - Zl\ dM^. 

By assumption (iii) of Theorem 5.7, and the fact Y} > Y^, unless 
\el[Y,l-Yl]\>e\\Y}^-Yl\\ 
for some e > 0, we must have 

elFHY}_)>elFHYl) 



BSDES ON MARKOV CHAINS 



33 



dt X P-a.s. This implies that, dt x P-a.s., 

(5.8) - elF\Y}_) + elF\Yl) < -elF^Yl) + elF^Vl), 

and by assumption (ii) of Theorem 5.7, this last term is nonpositive. 

Now let s* = mi{s:el5Ys ^0} AT. Then as 5Y is cadlag, eldY^*- = 0. 
We wish to show s* = T. 

For s = s* , the fact 6Y > and e^dYs*^ = implies, by assumption (iii) 
of Theorem 5.7, either that: 

1. 6Y,,^ = 
or that 

2. = \el[Y,\_ - Y,l_]\ < e\\Y,\_ - Y,l_\\ 

for all e > 0, and therefore (5.8) holds. 

In the first case, we can use the strict comparison of Theorem 5.7 to show 
that 6Ys = for all s>s*. Therefore s* = T. 

In the second case, for simplicity of notation, for s < 0, we define, Y^ = Yq, 
Z\ = 0, F^{uj, s, Z,Y) = for i = 1, 2. 

Suppose s* < T, so s* = mi{s : el6Ys 7^ 0}. For small As,e > 0, we can 
rewrite (5.7) as 

el6Ys,+As = el6Ys*.s - el[F\Y},_^) - F\Y,1_M^s + s* - (s* - 5) V 0) 
+ 0((e + As)2)+ / el[Z',-Zl]dM^. 

J]s*-£,s*+As] 

Note for s* > and < e < s*. 

As + s* - (s* - e) V = As + e. 

We know from (5.8) that the term -el[F^{Y},_^) - F"^ {Y^* _^)]{As + s* - 
(s* — e) V 0) is nonincreasing in As, and therefore, as Jj^..^ s*+As] ^li^u ~ 
Z^] dMu is a martingale in As, 

-e\[F\Y}._^) - F\Y^,_M^s + s* - (s* - e) V 0) 

+ / el[Zi-Zl]dM^ 

J]s*-e,s*+As] 

cannot be P-a.s. nondecreasing in As except if it is zero. 
However, this implies that for all sufficiently small As, 

e\5Ys*+i:,s = e\5Ys*^e = ^- 

This contradicts our assumption s* = inf{s : e\6Ys 7^ 0}, and hence s* = T in 
the second case. 

Therefore, we must have e\5Y = constant. In other words, we must have 
g*jyi _ y2j _ is, the strict comparison holds componentwise. □ 
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5.1. A vector example. 



Example 5.1. To demonstrate the conclusion of Theorem 5.7, we con- 
sider the following example. Consider two BSDEs with the same driver, 

"O O" 



Fiuj,t,Yt.,Zt) = f{u;,t) + 



1 1 



for a left continuous, dtxF square integrable, process / = [/i,/2]*. Sup- 
pose that > P-a.s. and that the terminal time T is deterministic. Un- 
der these conditions. Theorem 5.7 should hold. Here the difference Y^ — 
satisfies the equation 



Y/ - ¥/ 





"o o" 


1 

J]t,T] 


1 1 



- Y^_) du+ f [Zi - Zl] dMu = Q^-Q^ 

J]t,T] 



Our results on linear BSDEs, along with the fact that 



n. 





1 1 



du 



exp 





1 1 

t-s 



it 



1) 





1 1 



implies 



Yt'-Y,^ = E 



I+{e 



T-t 





1 1 



1 







{Q'-Q') 

E[Q'-Q^\Tt]. 



It is then clear that the first component of Y^ — will be simply the 
conditional expectation. If this reaches zero for some t, as > P-a.s., 
then the first component of — must be almost surely zero given 
On the other hand, the second component is a positive sum of the two 
components of — Q^. If this equals zero for some t, then both components 
of — must be almost surely zero given Tf 

Conversely, if we had 



F{co,t,Yt.,Zt) = fiu;,t) + 







-1 



Y. 



Theorem 5.7 would not apply. In this case we would find 



Y,' - Y,^ 



+ 1 







If then the second component of — has small conditional expectation 
compared with the first, for any t <T we see that the second component of 
Y^ — is negative. 
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Remark 5.9. A possible economic interpretation of the third assump- 
tion of Theorem 5.7 is the foUowing. Suppose that Yj represents a vector of 
prices. If one asset has an increased price, that should not, ceteris paribus, 
lead to other assets increasing in price. Assuming that the economy is closed 
this assumption may be reasonable, as investors must make a decision as to 
how to partition their portfolios, and an increase in one partition should not 
increase other partitions. 

6. i^-expectations. One interpretation of the solution to a BSDE is as 
a type of generalized expectation. In particular, for a fixed standard driver, 
F, and Q € M^, an J-j measurable, square integrable random variable, we 
can define the conditional F-evaluation to be 

(6.1) £s,m=Ys 

for s <t where Yg is the solution to 

Ys- I F{oj,u,Yu-,Zu)du+ I ZudMu = Q. 

J]s,t] J]s,t] 

Definition 6.1. Following [15], we shall call a system of operators 
£s,t:L\Ft)^L\Fs), 0<s<t<T, 

an J^i-consistent nonlinear evaluation for {Qs,t C L^{J-t)\0 < s <t < T} de- 
fined on [0,T] if it satisfies the following properties: 

1. For Q,Q'eQs,t, 

£sAQ)>SsAQ') 

P-a.s. componentwise whenever Q >Q' P-a.s. componentwise, with equal- 
ity iff Q = Q' P-a.s.; 

2. 5t,t(Q) = Q P-a.s.; 

3. £r,s{£s,t{Q)) = £r,t{Q) P-a.s. for any r < s<t; 

4. For any AeTs, IaSsAQ) = IaSsA^aQ) P-a.s. 

We are here allowing a significant generalization of [15], as our evaluations 
can all be vector valued, provided they are square integrable. This general- 
ization allows the use of these evaluations in multi-objective problems, where 
a scalar evaluation process may be insufficient. 

We also only require these properties to hold on some subset Qs^t of the set 
of square integrable terminal conditions, a distinction which shall become 
important when we consider arbitrage and finite market modelling. Note, 
however, that £s,t is defined over the whole of L'^{J-t), but it is only on Q* 
that property 1 holds. Furthermore, we add the restriction that ii Q > Q' 
P-a.s., Q,Q' G Qt, then £,AQ) = SsAQ') ^-a-s. iS Q = Q' P-a.s. 
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Remark 6.1. If <fs,t is an J-j-consistent nonlinear evaluation for Qs^t, it 
is also an J"(-consistent nonlinear evaluation for any subset of Qs,t- 

Definition 6.2. We often wish for the sets {Qs,t} to be stable through 
time. That is, £ will be called dynamically monotone for {Qs,t} if and only 
if, for all r < s <t: 

(i) £ is an J^t consistent nonlinear evaluation for {Qs,t}', 

(ii) Qr,t Q Qs,t {Qs,t is nondecreasing in s); 

(iii) £s,'t{Q) G 'Qr,s for ah Q G Qr,f 

Remark 6.2. In an economic context, where £s^t represents the price 
at s of an asset to be sold at time t, dynamic monotonicity corresponds, in 
some sense, to the statements: 

(ii) "An asset with an arbitrage free price, when bought at a time r and 
sold at time t, should also have an arbitrage free price when bought at any 
time s following r (with s prior to t)" and 

(iii) "An asset with an arbitrage free price, when the asset is bought at 
a time r and sold at time t, should be arbitrage free when sold at any time 
s prior to t (with s following r)." 

Theorem 6.3. Fix a driver F. Consider a collection of sets {Qs,t C 
L'^{J-'t)} with Qr^t ^ Qs,t for all r < s <t. Suppose that, for any Q^,Q^ € 
Qs,t, at least one of Theorems 5.1, 5.3 and 5.7 holds on [s,t], with 
F^ = F"^ = F whenever > P-a.s. Then £f^ defined in (6.1) is an Tt- 
consistent nonlinear evaluation for {Qs,t}- 

Proof. 1. The statement £^^{Qi) > £ff{Q2) P-a.s. whenever Qi > Q2 
P-a.s. is simply the main result of each comparison theorem, one of which 
holds by assumption. The strict comparison then establishes the second 
statement. 

2. The fact £ff{Q) = Q P-a.s. for any Tt measurable Q is trivial, as we have 
defined £f^{Q) by the solution to a BSDE, which reaches its terminal 
value Q at time t by construction. 

3. For any r < s <t, let Y denote the solution to the relevant BSDE. Then 
we have 



Q = Yr- / F{uj, u, Yu- ,Zu)du+ Zu dMu, 

J]r,t] J]r,t] 

which implies 

Ys = Yr- I F{oj,u,Yu-,Zu)du+ j Z^dM^. 

J\r.s\ J]r.s\ 
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Hence is also the time r value of a solution to the BSDE with terminal 
time s and value Yg. Hence 

C«t(Q))=C(Q) 

desired. 

4. We wish to show that for A € Tg, IaSsAQ) = ^a^sA^aQ) P-a.s. Write 
^s,t{Q) a-s the solution to the BSDE with terminal value Q, and £s,t{lAQ) 
as the solution to the BSDE with terminal value IaQ- Premultiply- 
ing these BSDEs by Ia gives two BSDEs, both with terminal value 
IaQ-, driver F{io,t,Yt, Zt) = lAF{io,t,Yt, Zt) and solutions lA£s,t{Q) and 
lA£s,t{lAQ), respectively. From Theorem 1.1, the solution to this BSDE 
is unique, hence lA£s,t{Q) = lA£s,t{lAQ) up to indistinguishability. □ 

Remark 6.4. As we have seen in Section 4.1, even in the scalar case, the 
assumption that a comparison theorem holds, and hence Sf^ is a nonlinear 
evaluation on Qs,t, is nontrivial. 

Definition 6.3. Let {Qs,t C L^(J't)} be a family of sets which are non- 
decreasing in s. For a BSDE driver F^, suppose that, for all s < t < T, 
assumption (iii) [and assumption (iv), when applicable], of at least one of 
Theorems 4.2, 5.1, 5.3 and 5.7 hold on ]s,t] for all Q^,Q^ € Qs,t [whether or 
not assumption (i) holds]. Then F^ is said to be a balanced driver on {Qs,t}- 

Remark 6.5. The logic of this name is due to the geometry of the 
problem, as, in some sense, F^ here balances the outcomes with zero hedging 
within the range of outcomes with hedging. 

Lemma 6.6. Let F be a balanced driver on {Qs,t} where Qs^t is nonde- 
creasing in s. Then £^ is an Tt consistent nonlinear evaluation on {Qs,t}- 

Proof. The requirements for F to be balanced are stronger than those 
needed for Theorem 6.3, and so the result follows. □ 

Lemma 6.7. Fix a driver F balanced on {Qs,t} where Qs.t is nonde- 
creasing in s. Then there exists a family of sets, {Qs,t}, such that £^ is 
dynamically monotone for {Qs,t} md Qg^t Q Qs,t for all s <t <T. 

Proof. Just as the proof of Theorem 6.3 (point 3), it is easy to see that 
£^ is recursive, that is, 

£rU£^,tm = £^AQ)- 

Define 



Qs,t = {£t^uiQ)\Q € Qs,u for some u>t}. 
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As F is balanced on {Qs,t}, assumptions (iii) and (iv) of the relevant 
comparison theorem must hold on ]s,u\ for all Q € Qs^u- As £^ is recursive, 
it follows that assumptions (iii) and (iv) of the comparison theorem must 
hold on ]s,t] for all Q S Qs,t, that is, F is balanced on {Qs^t}- It follows from 
Lemma 6.3 that £^ is an consistent nonlinear evaluation on {Qs,t}- 

As Qs,t is nondecreasing in s, it is clear that Qs^t is nondecreasing in s. 
Also, as £fi{Q) = Q, it is clear that Qs^t ^ Qs,t for all s < t. 

Finally, for any Q S Qr,t, there exists a u>t with Q = £f^{Q') for some 
Q' S Qr,u- Therefore, by recursivity, 

£f,tiQ)=£UQ')^Qr,s. □ 

Definition 6.4. Again following [15], we shall call a system of opera- 
tors, 

£{-\Tt):L\TT)^L\Tt), 0<t<T, 

an J-t-consistent nonlinear expectation for {Qt CL^{J-t)} defined on [0,r] 
if it satisfies the following properties. 

1. For Q,Q'eQt, 

£iQ\J't)>£{Q'\J't) 

P-a.s. componentwise whenever Q >Q' P-a.s. componentwise, with equal- 
ity iff Q = g' P-a.s. 

2. £{Q\Tt) = Q P-a.s. for any J't measurable Q. 

3. £{£{Q\Tt)\Ts) = £{Q\Ts) P-a.s. for any s<t. 

4. For any A G Ji, lA£{Q\J't) = £{lAQ\J't) P-a.s. 

Remark 6.8. It is clear that any nonlinear expectation is also a non- 
linear evaluation, with £s,t{^) = £{'\^s) for all s <t, and hence the concept 
of dynamic monotonicity extends to this new setting. On the other hand, 
as the terminal time is irrelevant in this context, one can simply specify the 
sets Qt = Qf , as the expectation refers without loss of generality to the ter- 
minal values at time T. Therefore, we say £{-\J-t) is dynamically monotone 
for {Qt} if, for ah s<t: 

(i) £ is an Ft consistent nonlinear expectation for {Qt}', 

(ii) Qs ^Qt {Qt is nondecreasing in i); 

(iii) £{Q\Ft) G Qs for ah Q G Q,. 

Theorem 6.9. Fix a driver F such that F{uj,t,Yt^,0) = dtx P-a.s. 
Consider a family of sets, {Qt C L'^{Ft)}, such that for any Q, Q' € Qt with 
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Q > Q' , cit least one of Theorems 5.1, 5.3 and 5.7 holds on ]t,T] with 
= F"^ = F . The functional £^ {■\J-t) defined for each t by 

(6.2) S''iQ\J^t) = Yt, 

where Yt is the solution to 

Yt - [ F{io,u,Yu-,Zu)du+ [ ZudMu = Q 

J]t,T] J]t,T] 
is an J^t- consistent nonlinear expectation for {Qt}- 

Proof. Properties 1 and 3 follow exactly as in the proof of Theorem 
6.3. 

Property 2 follows because we know that for t < T, if Q is Ft measurable 
then the solution of 

Yt - I F{io,u,Yu-,Zu)du+ / ZudMu = Q 

J]t,T] J]t,T] 

has Zu = d{M, M) u X P-a.s. (Simply take an Ft conditional expectation 
as done earlier.) Hence for t < u < T, F{uj,u,Yu-, Zu) = 0, and therefore 
Yt = Q P-a.s., as desired. 

For property 3, we know that lA£{Q\Ft) is the solution to a BSDE 
with terminal value IaQ and driver lAF{uj,t,Y, Z). As F{uj,t,Yt-,0) = 0, 
lAF{uj,t,Y,Z) = F{u},t,Y,lAZ) P-a.s. We also know £{lAQ\Ft) is the so- 
lution to a BSDE with driver F, and taking an Ft conditional expectation 
shows that the solution Z process for £{lAQ\Ft) satisfies Z = IaZ. Hence 
the two quantities solve the same BSDE, and so by the uniqueness of The- 
orem 1.1 must be equal. □ 

Corollary 6.10. Fix a driver F balanced on {Qt} where Qt is nonde- 
creasing in t and F{uj,t,Yt-,0) = dtx P-a.s. Then there exists a family of 
sets {Qt} such that £^ is dynamically monotone for {Qt} and Qt ^ Qt for 
all s<t<T. 

Proof. This follows exactly as in Lemmas 6.6 and 6.7 and Theorem 
6.9. □ 

Theorem 6.11. For any {Qt}, there exists a nonlinear F -expectation 
on {Qt}. 

Proof. Simply take F = Q. Then the nonlinear expectation corresponds 
to the classical expectation (and is balanced on {Qt}), and immediately 
satisfies all the desired properties. □ 



40 



S. N. COHEN AND R. J. ELLIOTT 



7. Dominance, arbitrage and i^-expectations. In this section we shall 
explore in detail the behavior of £^ given different sets {Qs,t\- We wish to 
discuss the relationship between F-expectations, arbitrage and assumption 
(iii) of Theorem 4.2. This will highlight how this model can be used to un- 
derstand phenomena which may be lost under a model driven by Brownian 
motion and the importance of considering nonlinear expectations only for 
subsets {Qs,t C Lp'{Ft)}- Throughout this section, we will consider only a 
single, fixed driver F, and all vector inequalities are to be read component- 
wise. 

In this context we need to distinguish between the closely related concepts 
of arbitrage and dominance. This is essentially because our "pricing rule," 
as defined by the solutions to a BSDE, is nonlinear. 

We begin with definitions of dominance and arbitrage in this context. 

Definition 7.1. Fix an initial time s. Consider a terminal time t>s 
and a pair of terminal values Q^,Q^ G L'^{J-t). Let and Y'^ a pair of 
corresponding evaluation processes. If, for some A & J-g, we have > 
and < Yg both P-a.s. on A, and at least one of these inequalities is strict 
with positive probability on A, then we shall say that dominates at 
s, under this evaluation, given A. 

Definition 7.2. Fix an initial time s. Consider a terminal time t > s 
and terminal value Q G L?'{J-t). Let Y be the the corresponding evaluation 
process. If, for some A^ Tg, we have Q > and Ys < 0, both P-a.s. on A, 
and at least one of these inequalities is strict with positive probability on 
A, then we shall say that Q is an arbitrage opportunity at s, under this 
evaluation, given A. 

Definition 7.3. We shall say that a set Qs^t C L'^{J^t) allows domi- 
nance if there exists Q^^Q^ € Qs^t with dominating at s. Similarly 
for arbitrage. 

Remark 7.1. If the evaluation of the terminal value Q = is Yg = 
P-a.s. on A, then an arbitrage opportunity is simply a strategy which 
dominates the zero strategy. 

This distinction and terminology has a simple interpretation when we 
think of our evaluations as prices. An "arbitrage" opportunity is any situa- 
tion where, for no positive outlay today, a positive outcome can be ensured 
in the future. On the other hand, for an asset to dominate another, is to 
state a strict preference relation between the two, as the better option in 
the future is at least as cheap today. (Examples of this price interpretation 
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of BSDE solutions, for example, to option pricing under constraints, can be 
found in [7].) 

It is significant in this context that these situations are not scale- invariant. 
This means that there may be a situation where there is a small arbitrage 
opportunity, but the nonlinearity of the pricing rule means that this cannot 
be replicated significantly. 

When F is linear and Q^,* is a vector space, as is implicitly assumed in a 
Black-Scholes market model, the existence of dominance and the existence of 
arbitrage are equivalent. In our context, we need to maintain the distinction 
between them, particularly due to the nonlinearity of F. 

Theorem 7.2. For any J-t consistent, nonlinear evaluation for {Qs^t} 
(in the sense of Definition 6.1), the sets Qs^t do not allow dominance under 
this evaluation. 

Proof. This is simply the first property of nonlinear evaluations. □ 

Corollary 7.3. Let F be a balanced driver on {Qs,t}- Then the sets 
Qs^t do not allow dominance, under the pricing rule given by the BSDE 
solutions with driver F. 

Proof. As F is balanced, by Lemma 6.6 we know that such an F defines 
an J^t consistent nonlinear evaluation for {Qs,t}- D 

Theorem 7.4. For any Ft consistent nonlinear expectation for {Qt} 
with € Qj for all t, the sets Qt do not allow either arbitrage or dominance 
under this nonlinear expectation. 

Proof. These are simply the first and second properties of nonlinear 
expectations, and an application of Remark 7.1. □ 

Corollary 7.5. Consider a driver F satisfying F(a;,i,0,0) = 0, dtxF- 
a.s. on [0,r]. Then if F is a balanced driver on some sets {Qt} with G Qt 
for all t, the sets Qt do not allow arbitrage under the evaluation given by 

Proof. We know that such an F defines a nonlinear expectation on Q, 
by Corollary 6.10. □ 

As we have seen in Section 4.1, there exist examples of BSDE solutions 
in this context where arbitrage opportunities do exist. This is a significant 
distinction between these models based on Markov chains and models based 
on Brownian motions. 



42 



S. N. COHEN AND R. J. ELLIOTT 



Remark 7.6. For given market data, we may ask when the data are 
consistent with arbitrage free BSDE pricing. Let Qo,t denote the time t 
values of all possible tradable assets in the market, including combinations 
of assets. Our question is then equivalent to asking if there exists a driver F 
such that there is a nonlinear evaluation S^ti-) for {Qo,t} which maps each 
terminal condition to its current price. 

If there is a unique F that will do this, we have a complete, arbitrage free, 
BSDE pricing scheme. This does not guarantee that the BSDE solutions for 
terminal conditions not in {Qo,t} will be valid, arbitrage free prices. 

Remark 7.7. Under BSDE pricing with Markov chains, we may have 
arbitrage free pricing for those assets in the market, but not for other assets 
we wish to consider. If we were to hypothetically introduce other assets into 
the market, while retaining no arbitrage, this could result in changing the 
driver F, and hence the dynamics of all stocks in the market. 

8. Geometry. Different assumptions on {Qs,t} and F imply different ge- 
ometric results on the values of Yt. For simplicity, we here restrict our at- 
tention to the scalar {K = 1) case. 

Definition 8.1. For any s, we define Hs{Q) to be the essential convex 
hull of Q at time s to be the smallest J-g measurable convex set such that 
P{QeH,{Q)\Fs) = l. 

Definition 8.2. We define r .i. i/^ (Q) to be the relative interior of Hs{Q), 
that is, the interior of Hs{Q) viewed as a subset of the affine hull it generates. 

Remark 8.1. The interested reader is referred to any good book on 
elementary stochastic finance for a more detailed definition (e.g., [11], page 
27 or [10], page 65). 

Theorem 8.2. Consider a scalar (K = 1), nonlinear F- expectation £^ {■\Tt) 
for {Qt}- Suppose, for all q gM., g € Qq. Then for all Q G Qq, F-a.s., 

£^{Q\To)er.lHo{Q). 

Proof. If Hq{Q) is singular, that is, it contains only a single point, then 
Q is J-Q measurable (up to equality P-a.s.), and so Yq = Q P-a.s., and the 
condition is trivial. 

Otherwise, we first wish to show that £^{Q\J^q) > inf Hq{Q). Let 

Q"^"^ = inf//o(Q). 

As this is J-Q measurable, the solution to the BSDE with driver F and 
terminal condition Q™™ is simply Yq = Q™™. If Hq{Q) is unbounded below. 
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Qmm _ ^YiQ statement is trivial. Otherwise, Q™™ G M, and so Q™"-' G 

Q. No dominance in Q then imphes that 

We can then repeat this argument with Q™^^ = sup Hq{Q), to show that 
£^{Q\To) <supHq{Q). Hence £^{Q\To) lies strictly within the interior of 
HoiQ), which, as we are dealing with nonsingular, scalar valued Q, corre- 
sponds precisely with the set r.i. i/o(Q) as desired. □ 

Remark 8.3. Clearly, this result can be extended to show £^{Q\J^t) G 
r.i. Ht{Q). However, care must be taken as there is no guarantee that Q™™ := 
miHt{Q) G L^(J't), even though Q™™ is Ft measurable. 

To show this general result, the existence theorem must be strengthened 
to show that if E[Q'^\J^t\ < +00 P-a.s., then there exists a solution {Y,Z) to 
the BSDE (1.3) on [t,T]. With this extension, we can weaken our definition 
of Qt to be 

Qt C {Q : Q G L^iTT),E[Q^\Tt] < +00, P-a.s.}. 
We then assume that L^{Tt) C Q^, and the result follows. 

9. Other properties of and applications to risk measures. One use 

of BSDEs which has developed recently is as a framework for developing 
dynamically consistent, convex or coherent risk measures. This can be seen 
in [2], and more generally in [16]. We here present the key results in the 
context of BSDEs driven by Markov chains. 

As noted in [15], the theory of nonlinear expectations provides an ideal 
setting for the discussion of risk measures in the sense of [1] and others. We 
shall consider dynamic risk measures pf of the form 

p!{Q):=-£It{Q), 

where £^ is a nonlinear F-evaluation for some {Qs,t C L^(J-'t)}. As before, 
our approach allows a considerable generalization of earlier results, as the 
quantities considered can be vector valued. 

An alternative specification, used in [2] and [16], is to let pf {Q) = £frp(—Q), 
the solution to a BSDE with terminal condition —Q. Under this alternative 
specification, the following results remain valid, however, the result of The- 
orems 9.7 holds for p with F convex rather than concave. 

We seek to determine properties of pf , or, equivalently, of Sfj, which are 
particularly relevant in the context of risk measures. 

The first four properties below follow directly from the properties of non- 
linear evaluations, recalling the proofs of Theorems 6.3 and 6.9, with Qt = 
where appropriate. 
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Lemma 9.1 (Terminal equality). For any F, pf satisfies the terminal 
condition p^{Q) = —Q- 



Proof. Equivalent to property 2 of nonlinear evaluations. □ 



Theorem 9.2 (Dynamic consistency). For any F, pf satisfies the re- 
cursivity and dynamic consistency requirements, 



PsiQ)=pfi-pfm 



and 



p(iQ)=pf{Q') pf{Q)=pf(Q'), 



for times s <t <T, all equalities being ¥-a.s. 

Proof. The first statement is equivalent to property 3 of nonlinear 
evaluations. The second follows by uniqueness of solutions to BSDEs with 
given terminal conditions, as in Theorem 1.1. □ 

Lemma 9.3 (Constants). Let Q be ¥-a.s. equal to a Ft measurable ter- 
minal condition. Then if F is a driver with normalization F{Lj,t,Q,0) =0 
dt X P-a.s., pf (Q) = -Q for all t. 

Proof. Equivalent to property 4 of nonlinear expectations. □ 

Theorem 9.4 (Monotonicity). Let F be a balanced driver for {Qj,t}- 
Then for any Q^,Q'^ G Qt,T with > F-a.s., we have pf {Q^) < pf{Q'^), 
with equality if and only if = Q^, P-a.s. 

Proof. Equivalent to property 1 of nonlinear evaluations. □ 

Lemma 9.5 (Translation invariance). Let F be a driver for a BSDE with 
normalization ^(a;, u, 0) = P x du-a.s., on ]t,T]. Suppose F does not 
depend on Yu-- Then for any q € L'^{Tt), 



p[{Q + q) = p[iQ)-q, 
£t,T{Q + <i) = £t,T{Q) + i- 



Proof. As F does not depend on we know that 




implies 




The result follows. □ 
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Theorem 9.6 (Positive homogeneity). Let F be a positively homoge- 
neous driver, that is, for any A > 0, if (Y, Z) is the solution corresponding 
to some Q, P x du-a.s. on ]t,T], 

F{uj,u,XYu-,XZu) = XF{u},u,Yu-,Zu). 
Then for all such Q, pf (XQ) = Xpf (Q) and £fj,(XQ) = X£fj,[Q). 

Proof. Simply multiply the BSDE with terminal condition Q through 
by A. □ 

Theorem 9.7 (Convexity/concavity). Suppose Qt^T is a convex set. Let 
F he a concave balanced driver for Qt^T, that is, for any X € [0,1], any 
(y^, Z^), (y^, Z^) the solutions corresponding to Q^,Q^ G Qt,T, P x du-a.s. 
on]t,T], 

F{u;, u, Ayi_ + (1 - X)Y^_,XZl + (1 - X)Zl) 

> XF{u:, u, yj_ , + (1 - x)Fioj, u, y„2_ , zl). 

Then for any X € [0, 1] and any Q^,Q^ G Qt,T, 

pfiXQ' + (1 - A)Q2) < Apf (Qi) + (1 - X)pf{Q% 
£It{^Q^ + (1 - A)Q') > X£It{Q^) + (1 - X)£It{Q^). 

Proof. Taking a convex combination of the BSDEs with terminal con- 
ditions and gives the equation 

Ay/ + (1 - X)Y^ - [ [XF{iv,u,Y^_,Zu) + (1 - X)F{oo,u,Y;^_,Zl)]du 

J]t,T] 

+ [ [AZ^ + (1 - A)Z2] dM„ = AQi + (1 - A)Q2, 

J]t,T] 

which is a BSDE with terminal condition XQ^ — (1 — X)Q^ and driver 

F = XF{iv, u, yj_ , zj + (1 - x)F{iv, u, yj„ , zl). 

We next consider the BSDE with terminal condition XQ^ + (1 ~ ^)Q^ and 
driver F. Denote the solution to this by Y^. We can compare these BSDEs 
using the relevant comparison theorem — the first assumption of the theorem 
is trivial, the second is satisfied by the convexity of F, and the remainder 
because F is balanced on Qt^r- Hence, our solutions satisfy 

y^ > Ay^ + (1 - A)y2 

with equality if and only if the terminal conditions are equal with conditional 
probability one. The inequality for Sfj. follows. The inequality for p[ can 

then be established by noting that as £frp[Q) is concave in Q, it follows that 

pf {Q) = —£fj.{Q) is convex in Q. □ 
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10. Conclusions. BSDEs with an underlying finite state Markov chain as 
a stochastic process form a viable alternative to BSDEs based on Brownian 
motions and Brownian motions with Poisson jumps. We have given closed- 
form solutions to linear BSDEs on Markov chains, and then have used these 
to develop a comparison theorem for the solutions. 

These BSDEs allow for very delicate modelling of no arbitrage condi- 
tions. In particular, they allow no arbitrage only to be required on a subset 
of all possible terminal conditions, but the pricing rule to allow arbitrage 
outside this set. This is in contrast to BSDEs based on Brownian motion, 
where the comparison theorem guarantees no arbitrage in all cases. We have 
drawn a connection between the exclusion of arbitrage and the presence of a 
balanced driver F, a condition which can be understood as an infinitesimal 
no-arbitrage condition. We have also discussed the geometry of these BSDEs 
and the relation to no arbitrage conditions. 

Finally, we have shown key properties which can be used when develop- 
ing nonlinear evaluations and risk measures from these BSDEs. The added 
requirements of the theorem help us to distinguish precisely what is needed 
for each of these properties to hold which may prove useful when considering 
other models. 

APPENDIX: EXAMPLE PREDICTABLE QUADRATIC 
COVARIATION MATRIX 

For clarity, an example of is presented here for = 4. Suppose Xt- = 
[0,0, 1,0]*, AiXt- = [01,02, -oi - 02 - 04,04]*. Then 
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